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BACKGROUND 

Field of the Invention 

The present invention relates computer security and intrusion detection 
systems. More specifically, the present invention relates to a method and an 
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apparatus for providing content-based intrusion detection using an agile kernel- 
based auditor. 

Related Art 

5 As computers become increasingly more interconnected, it is becoming 

progressively harder to safeguard computer systems from attacks launched across 
computer networks. Several types of attacks, such as buffer overflow attacks, and 
attacks that make unauthorized modifications to data objects, can be detected by 
examining data that is being read to and/or written from security critical files or 
1 0 network connections. 

Unfortunately, existing intrusion detection systems cannot reliably detect 
H these types of attacks because they do not possess the ability to examine data that 

is being read or written during system calls. 

For example, an existing auditing system may record system call 
p 1 5 parameters or attributes of subjects and objects involved in the system calls. 
□ However, existing auditing systems do not record data that is being read from or 

written to files or network connections because the volume of data that is read or . 

UJ written is prohibitively large. 

Q 

p Some network sniffers can collect data being read from and/or written to 

20 files across a network. However, network sniffers cannot gather information 
regarding accesses to local files. Furthermore, network sniffers can suffer 
performance and packet-loss problems if they try to collect this type of data 
because as mentioned previously the volume is prohibitively large. Also, 
encryption is increasingly being used to protect the privacy of data transmitted 
25 across networks. Consequently, network sniffers will eventually be unable to 
obtain useful audit data. 
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Hence, what is needed is a method and apparatus for monitoring systems 
calls that gathers read and/or write data for intrusion detection purposes without 
encountering problems in handling large volumes of data. 

Another problem is that existing auditing systems are not configured to 
5 collect information for specific intrusion detection systems. Existing auditing 
systems are typically developed by operating system developers, who do not 
necessarily know what types of data are required by intrusion detection systems. 
Consequently, existing auditing systems are not configured to gather parameters 
and/or other attributes that are required by an intrusion detection system. 
10 Furthermore, an intrusion detection system may require different types of data to 
be gathered at different times. 

Hence, what is needed is a method and an apparatus that can be configured 
to selectively gather specific system call information for an intrusion detection 
system. 

15 

SUMMARY 

One embodiment of the present invention provides content-based intrusion 
detection for a computer system by using an agile kernel-based auditing system. 
This auditing system operates by receiving an audit specification that specifies 

20 target attributes to be recorded during an auditing process. The audit specification 
also specifies an auditing criterion that triggers recording of the target attributes. 
Upon receiving the audit specification, the auditing system is configured to record 
the target attributes during system calls whenever the auditing criterion is 
satisfied. Next, an application program is monitored by the auditing system to 

25 produce an audit log containing the recorded target attributes. This audit log is 
examined in order to detect patterns for intrusion detection purposes. 
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In one embodiment of the present invention, configuring the auditing 
system involves compiling the audit specification to produce a kernel module, and 
then loading the kernel module into a kernel of an operating system. It also 
involves linking code from within the kernel module into system calls within the 
5 operating system. 

In one embodiment of the present invention, in response to detecting an 
event during the auditing process, the system dynamically adjusts the auditing 
system to change the auditing criterion and/or the target attributes for subsequent 
operation of the auditing system. 
10 In one embodiment of the present invention, the auditing system is 

configured to modify a system call jump table to cause selected system calls to 
execute code that causes the target attributes to be recorded in response to the 
auditing criterion being satisfied. 

In one embodiment of the present invention, the target attributes can 
1 5 include: an argument from a system call; a parameter of a process making the 

system call; data read during the system call; data written during the system call; a 
parameter of a file involved in the system call; and a parameter relating to a 
network communication involved in the system call. 

In one embodiment of the present invention, the auditing criterion can 
20 include: a user identifier for a process that is making a system call; an identifier 
for an application program from which the system call is being made; and an 
identifier for a file being accessed by the system call. 

In one embodiment of the present invention, producing the audit log 
involves filtering the target attributes to reduce an amount of data stored in the 
25 audit log. 
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In one embodiment of the present invention, producing the audit log 
involves determining a characteristic of a target attribute, and recording the 
characteristic in the audit log. 

In one embodiment of the present invention, the audit specification is 
received from either a user of the auditing system, or an intrusion detection 
mechanism. 



FIG. 1 illustrates a computer system in accordance with an embodiment of 
the present invention. 

FIG. 2 illustrates the process if configuring an auditing system in 
accordance with an embodiment of the present invention. 

FIG. 3 illustrates how a system call jump table is modified in accordance 
with an embodiment of the present invention. 

FIG. 4 is a flow chart illustrating the process of configuring and running 
the auditing system in accordance with an embodiment of the present invention. 



The following description is presented to enable any person skilled in the 
art to make and use the invention, and is provided in the context of a particular 
application and its requirements. Various modifications to the disclosed 
embodiments will be readily apparent to those skilled in the art, and the general 
principles defined herein may be applied to other embodiments and applications 
without departing from the spirit and scope of the present invention. Thus, the 
present invention is not intended to be limited to the embodiments shown, but is 
to be accorded the widest scope consistent with the principles and features 
disclosed herein. 



BRIEF DESCRIPTION OF THE FIGURES 



DETAILED DESCRIPTION 
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The data structures and code described in this detailed description are 
typically stored on a computer readable storage medium, which may be any device 
or medium that can store code and/or data for use by a computer system. This 
includes, but is not limited to, magnetic and optical storage devices such as disk 
drives, magnetic tape, CDs (compact discs) and DVDs (digital video discs), and 
computer instruction signals embodied in a transmission medium (with or without 
a carrier wave upon which the signals are modulated). For example, the 
transmission medium may include a communications network, such as the 
Internet. 

Computer System 

FIG. 1 illustrates a computer system 102 in accordance with an 
embodiment of the present invention. Computer system 102 can generally include 
any type of computer system, including, but not limited to, a computer system 
based on a microprocessor, a mainframe computer, a digital signal processor, a 
personal organizer, a device controller, and a computational engine within an 
appliance. 

Computer system 102 is coupled to database 104. Database 104 can 
generally include any type of system for storing data in non-volatile storage. This 
includes, but is not limited to, systems based upon magnetic, optical, and 
magneto-optical storage devices, as well as storage devices based on flash 
memory and/or battery-backed up memory. Database 104 contains audit log 105 
for recording auditing information for intrusion detection purposes in accordance 
with an embodiment of the present invention. 

Computer system 102 is also coupled to remote computer system 118 
through network 116. Network 116 can include any type of wire or wireless 
communication channel capable of coupling together computing nodes. This 
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includes, but is not limited to, a local area network, a wide area network, or a 
combination of networks. In one embodiment of the present invention, network 
1 16 includes the Internet. 

Remote computer system 118 can include any entity that is capable of 
5 transmitting suspect code 108 across network 116 into computer system 102. 

Suspect code 108 may additionally be introduced into computer system 
102 by encoding suspect code 108 on a computer-readable storage medium, such 
as disk 120, and introducing disk 120 into computer system 102. Note that disk 
120 can generally include any type of computer-readable storage medium, such as 
1 0 a magnetic disk, a magnetic tape and a CD-ROM. 
O Also note that suspect code 108 may also be introduced into computer 

m system 1 02 through other communications mechanisms. 

rj During operation, computer system 102 executes suspect code 108 as well 

^ as intrusion detection system (IDS) 106. During execution, suspect code 108 

□ 1 5 makes a number of system calls through system call interface 112. These system 
g calls are intercepted by agile auditor 1 10, which causes target attributes from the 

p) system calls to be recorded within audit log 105 upon detection of an auditing 

UJ criterion. 

Fj Agile auditor 110 can generally include any type of mechanism for 

20 auditing system calls generated by suspect code 108. Note that agile auditor 110 
makes use of loadable kernel module 122, which contains code that records 
specified attributes for specific system calls upon the occurrence of specific 
auditing criteria. 

Agile auditor 1 10 in turn makes system calls through real system call 
25 interface 1 14 to access system call code 115. Note that real system call interface 
1 14 is a pre-existing system call interface for operating system 113. Agile auditor 
1 10 and system call interface 1 12 are layered on top of real system call interface 
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1 14 in order to intercept system calls generated by suspect code 108. 

Process of Configuring Auditing System 

FIG. 2 illustrates the process if configuring an auditing system in 
accordance with an embodiment of the present invention. The system starts with 
an audit specification 202 that specifies specific attributes to be recorded for 
specific system calls upon the occurrence of specific auditing criteria. 

Audit specification 202 feeds through a special audit specification 
compiler 204, which converts audit specification 202 into auditing code to 
implement audit specification. This auditing code is packaged into a loadable 
kernel module 122, which is loaded into operating system 1 13 within computer 
system 102. 

System Call Jump Table 

FIG. 3 illustrates how system call jump table 302 is modified in 
accordance with an embodiment of the present invention. System call jump table 
302 includes a number of entries that specify the location of corresponding system 
call functions. For example, entry 304 within system call jump table 302 would 
normally point to real system call code 308. However, during the process of 
linking loadable kernel module 122 into operating system 113, entry 304 is 
modified to point to code 306 within loadable kernel module 122. 

Code 306 first records a target attribute if a specific auditing criterion is 
satisfied. For example, upon detecting a write to a password file, code 306 may 
record all data that is written to the password file. 

Note that the target attribute can generally include any information related 
to the system call, including an argument of the system call, a parameter related to 
a process making the system call (such as a process ID, an effective user ID, a user 
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ID, a group ID, an effective group ID, a parent process ID, a session ID and a 
pathname for the process), data read during the system call, data written during 
the system call, a parameter related to a file involved in the system call (such as a 
permission mode, an inode number, a device ID, a time of creation, an owner user 
5 ID and a file type) or a parameter related to a network communication involved in 
the system call (such as an IP address or port number). 

Also note that the auditing criterion can generally include any specifier for 
a condition associated with a system call, including a user identifier for a process 
that is making the system call, an identifier for an application program from which 
10 the system call is being made or an identifier for a file being accessed by the 
system call. Note that the condition is satisfied if a currently used identifier 
matches the specified identifier. For example, if the identifier specifies a 
password file, if the password file is being currently accessed, the condition is 
satisfied. 

15 Next code 306 calls the real underlying system call through real system 

call interface 114. 

After the real system call returns, code 306 can record another target 
attribute in response to detecting another auditing criterion. This capability is 
useful for recording the result of the real system call. 

20 

Process of Configuring and Running Auditing System 

FIG. 4 is a flow chart illustrating the process of configuring and running 
the auditing system in accordance with an embodiment of the present invention. 
The system starts by receiving audit specification 202 (step 402). In one 
25 embodiment of the present invention, audit specification 202 is received from 
either a human user of the auditing system, or from an intrusion detection 
mechanism that automatically generates audit specification 202. 
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Audit specification 202 is compiled using audit specification compiler 204 
to produce loadable kernel module 122 (step 404). Next, loadable kernel module 
122 is inserted into the kernel of operating system 113 (step 406). 

This loading process involves modifying system call jump table 302 (from 
FIG. 3) so that code 306 is accessed during a reference to a specified system call 
(step 408). This causes the specified system call to record the target attribute if a 
specified auditing criterion is satisfied. 

Next, suspect code 108 is executed. This causes agile auditor 1 10 to 
record specified target attributes during specified system calls to audit log 105 
(step 410). Note that producing audit log 105 can involve filtering the target 
attribute to reduce an amount of data stored in audit log 105. This filtering may 
also involve determining a characteristic of the target attribute and storing the 
characteristic instead of the target attribute. For example, the auditing system may 
determine that data read during a system call is binary executable code. In this 
case, the characteristic "binary" can then be stored in audit log 105 instead of 
storing the binary executable code itself. 

Next, the system examines audit log 105 for intrusion detection purposes 
(step 412). Note that in general any type of intrusion detection mechanism can be 
used with the present invention. Hence, the details of the intrusion detection 
mechanism will not be discussed further in this specification. 

Also note that the present invention can be dynamically configured to 
gather specific information for specific intrusion detection mechanisms. Upon 
detecting an event during the auditing process (step 414), the system can 
dynamically adjust itself in response to the event (step 416). For example, upon 
detecting retrieval of data from a remote server, the system can record all reads 
and writes involving the process that retrieved the data. 
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Note that by selectively recording target attributes, the present invention 
can reduce the amount of data that is recorded during the auditing process. This 
makes it practical to record data that is read or written during system calls without 
overwhelming the storage capacity, processing power and/or data transfer 
bandwidth of a computer system. 

The foregoing descriptions of embodiments of the invention have been 
presented for purposes of illustration and description only. They are not intended 
to be exhaustive or to limit the present invention to the forms disclosed. 
Accordingly, many modifications and variations will be apparent to practitioners 
skilled in the art. Additionally, the above disclosure is not intended to limit the 
present invention. The scope of the present invention is defined by the appended 
claims. 
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